Implementation of Mel-Frequency Cepstral Coefficient as Feature Extraction using K-Nearest Neighbor for Emotion Detection Based on Voice Intonation
نویسندگان
چکیده
Purpose: To determine emotions based on voice intonation by implementing MFCC as a feature extraction method and KNN an emotion detection method.Design/methodology/approach: In this study, the data used was downloaded from several video podcasts YouTube. Some of methods in study are pitch shifting for augmentation, audio data, basic statistics taking mean, median, min, max, standard deviation each coefficient, Min max scaler normalization process classification.Findings/result: Because testing is carried out separately gender, there two classification models. male model, highest accuracy obtained at 88.8% included good fit model. female 92.5%, but model unable to correctly classify new data. This condition called overfitting. After testing, cause because augmentation one tone women solve problem training size being too small not containing enough samples accurately represent all possible input values.Originality/value/state art: The research has never been previous studies downloading Youtube then processed until ready be research.
منابع مشابه
Recognition Of Voice Using Mel Cepstral Coefficient & Vector Quantization
Human Voice is characteristic for an individual. The ability to recognize the speaker by his/her voice can be a valuable biometric tool with enormous commercial as well as academic potential. Commercially, it can be utilized for ensuring secure access to any system. Academically, it can shed light on the speech processing abilities of the brain as well as speech mechanism. In fact, this feature...
متن کاملA Vector Quantization Approach for Voice Recognition Using Mel Frequency Cepstral Coefficient (MFCC): A Review
This paper presents a brief survey on Automatic Voice Recognition so as to provide a technological perspective and an appreciation of the fundamental progress that has been accomplished in area of voice communication. The voice is a signal of infinite information. After years of research and development the accuracy of automatic voice recognition remains one of the important research challenges...
متن کاملMel-frequency cepstral coefficient-based bandwidth extension of narrowband speech
We present a novel MFCC-based scheme for the Bandwidth Extension (BWE) of narrowband speech. BWE is based on the assumption that narrowband speech (0.3–3.4 kHz) correlates closely with the highband signal (3.4–7 kHz), enabling estimation of the highband frequency content given the narrow band. While BWE schemes have traditionally used LP-based parametrizations, our recent work has shown that MF...
متن کاملNearest Neighbor For Histogram-based Feature Extraction
Manual grading process of Fresh Fruit Bunch (FFB) leads to misconduct and human error while inspecting the right category of fruits for the purpose of oil palm production at the mill. It is extremely important to identify the degree of ripeness of FFB are at 95% level of confidence as mentioned by Malaysian Palm Oil Board (MPOB). Therefore, wrong evaluation of graded fruits will result wrong re...
متن کاملVoice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques
Digital processing of speech signal and voice recognition algorithm is very important for fast and accurate automatic voice recognition technology. The voice is a signal of infinite information. A direct analysis and synthesizing the complex voice signal is due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Telematika: Jurnal Informatika Telekomunikasi Komputasi Elektronika dan Industri
سال: 2023
ISSN: ['2460-9021', '1829-667X']
DOI: https://doi.org/10.31315/telematika.v20i1.9518